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AMENDMENTS TO THE SPECIFICATION 

1 . Please replace paragraph [0052] of United States Patent Application Publication 
No. 20050055201 with the following amended paragraph: 

[0052] Further, in one embodiment, if the receiver is operating on a traditional fixed- 
frame mode, the extra data in the buffer will help smooth eventual fluctuations in the 
transmission delay (i.e., delay jitter). For example, one embodiment of the speech 
onset detector with burst transmission is used in combination with a method for jitter 
control as described in a copending United States utility patent application entitled 
"A SYSTEM AND METHOD FOR REAL-TIME JITTER CONTROL AND PACKET- 
LOSS CONCEALMENT IN AN AUDIO SIGNAL," f il ed TBD, and aoo i gnod Sor. No. 
JBB now Application Serial No. 10/663.390 filed 15 September 2003 . the subject 
matter of which is hereby incorporated herein by this reference. 

2. Please replace paragraph [0056] of United States Patent Application Publication 
No. 20050055201 with the following amended paragraph: 

[0056] Further, in one embodiment described with respect to the receiver side of a 
communications system, if the receiver is operating in a variable playout schedule, 
then it dynamically adjusts the delay by compressing or stretching the data in the 
receiver buffer, as necessary. In particular, this embodiment is described in a 
copending United States utility patent application entitled "A SYSTEM AND 
METHOD FOR PROVIDING HIGH-QUALITY STRETCHING AND COMPRESSION 
OF A DIGITAL AUDIO SIGNAL," now Application Serial No. 10/660,325 filed Sep. 

1 0, 2003, and aoo i gnod Ser i a l No. TBD, the subject matter of which is hereby 

incorporated herein by this reference. 
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3. Please replace paragraph [0088] of United States Patent Application Publication 
No. 20050055201 with the following amended paragraph: 

[0088] where N is the frame size and A is the starting point of the voice frame. 
Then, the energy of the last known silence frame ES is computed in §3© 420 using 
a similar expression (and it is assumed to be smaller than EV). A threshold T is 
established in ii@ 430 with a value between EV and ES, for example by setting 

T=(4ES + EV)/5 Equation 2 

4. Please replace paragraph [0089] of United States Patent Application Publication 
No. 20050055201 with the following amended paragraph: 

[0089] Then, a number (or all) samples c, in the buffer are selected 440 to be tested 
as possible starting points (onset points) of the speech. For each candidate point, 
the energy level of a number of samples equivalent to a frame is computed, starting 
at the candidate point. In particular, for each candidate point q, an energy E(Ci) is 
computed 450 as by Equation 3: 

E(Ci ) = ^ (4Ci + n]f Equation 3 

«=0 

5. Please replace paragraph [0091] of United States Patent Application Publication 
No. 20050055201 with the following amended paragraph: 

[0091] Note that the simple example illustrated by FIG. 4 is provided for purposes of 
explanation only. Clearly, as should be appreciated by those skilled in the art, the 
processes described with respect to FIG. 4 are based only on a frame energy 
measure, and does not use zero-crossing, spectral information, or any other 
characteristics known to be useful in determining voice presence in a particular 
frame. Consequently, this information, zero-crossing, spectral information, etc., is 
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used in alternate embodiments for creating a more robust speech onset detection 
system. Further, other well known methods for determining speech onset points 
from a particular sample of frames may be used in additional embodiments. For 
example, such methods include looking for the inflection point in the spectral 
characteristics of the signal, as well as recursive, hierarchical search methods. 
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